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Trypanosoma cruzi affects millions of people worldwide. Clinical variability of Chagas disease can be due to the genetic variabil- 
ity of this parasite, requiring further genome studies. Here we report the genome sequence of the T. cruzi Dm28c clone (Tel), a 
strain related to the sylvatic cycle of the parasite. 
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Trypanosoma cruzi is a protozoan parasite that causes Chagas 
disease, an illness that affects 8 to 10 million people in Latin 
America. The genetic variability found in the parasite population 
allows classification of strains into six major groups (Tel through 
TcVI) (1). In 2005, the T. cruzi CL Brener clone genome was 
published (2), but its complete assembly was hindered by repeated 
sequences and the hybrid nature of this strain. Other genomes, 
such as the Sylvio X10/1 strain, have been published (3), allowing 
comparative analyses to assess the variable clinical forms of Cha- 
gas disease. Whereas CL Brener (TcVI), together with the Tell and 
TcV strains, is the predominant cause of Chagas disease in the 
Southern Cone of Latin America (1, 2), Sylvio X10/1 and Dm28c 
belong to Tel, a group related to the sylvatic cycle that is also 
associated with Chagas disease in the Brazilian Amazon, Venezu- 
ela, Colombia, and Central America (3). Here, we present the 
genome sequence of the T. cruzi clone Dm28c (4), which has been 
widely used as a model for biochemistry, cell biology, immunol- 
ogy, gene expression, and drug development studies. 

Sequencing was carried out using the 454 GS-FLX Titanium 
platform. Two shotgun libraries and one 3-kb paired-end (PE) 
library were constructed using ±5 ;u.g of Dm28c DNA isolated 
from epimastigote cultures. Library construction, titration, emul- 
sion PCR, and sequencing steps were performed according to the 
manufacturer's protocol. Assembly of the Dm28c genome was 
carried out by processing the 454 SFF files using custom Perl 
scripts to generate paired-end (PE) FASTQ files. The SFF files were 
subsequently assembled using the Newbler software and the out- 
put was scaffolded using SSPACE 2.1.0. Gaps were improved by 
use of GapFiller 1.11 prior to automated annotation. Using this 
protocol, a total number of 5,287 contigs were generated, and 
91.7% of them (4,848) were assembled into 1,378 scaffolds, total- 
ing 30,716,540 bp. The longest contig was 89,747 bp. 

Automated functional annotation was performed de novo us- 
ing the System for Automated Bacterial Integrated Annotation 
(SABIA) (5). We identified 4,144 proteins with homology to 
known proteins from other organisms as well as 7,254 hypotheti- 



cal proteins, with a coding sequence (CDS) average length of 
1,397 bp. With an estimated haploid genome size of 27 Mbp and a 
G+C content of 53.26%, Dm28c is more similar, as expected, to 
Sylvio X10/1 (98.71% nucleotide sequence identity) than to CL 
Brener (90.20%), as revealed by a stringent Megablast analysis of 
both coding and noncoding regions. Accordingly, the Bidirec- 
tional Best Hit (BBH) analysis showed that DM28c has 6,094 pro- 
teins with best hits with SylvioX-10 and 5,267 proteins with best 
hits with CL Brener. The assembled genome together with the 
functional annotation is available on http://www.labinfo.lncc.br 
/index.php/trypanosoma-cruzi-dm28. 

Nucleotide sequence accession number. The T. cruzi Dm28c 
genome sequence is available in GenBank with the accession num- 
ber AYLP00000000. 
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